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Abstract. We generalize the Davenport transform and use it to prove that, for a 
(possibly non-commutative) cancellative semigroup A = {A, +) and non-empty finite 
subsets X,Y of A such that the smallest subsemigroup generated by Y is commutative, 
one has IX -I- ^1 > n{X, Y), where 

Q{X, Y) := min ( |A:| + |F| - 1, sup min ord(y - yo) ] . 

The result extends the Cauchy-Davenport theorem to the broader setting of semi- 
groups. Also, it strengthens a previous generalization by G. Karolyi relating to sum- 
sets in a commutative group, for which f2(JY", Y) in the above inequality is replaced 
with min(p(A), \X\ -\- \Y\ — 1), where p(A) is the order of the smallest non-trivial sub- 
group. While Karolyi's bound is never better than the one provided by Q{X,Y), we 
show that it is in fact much worse in significant cases. Moreover, we prove that our re- 
sult implies, as a rather immediate corollary, an extension of I. Chowla's generalization 
of the Cauchy-Davenport theorem to arbitrary cyclic groups. 



1. Introduction 

The present paper is focused on the additive theory of semigroups, in continuation to 
the work initiated in [21)], which the reader is recommended to consuh for (possibly 
non-standard) notation and terminology employed here without explanation. For the 
sake of generality, some definitions and results will, however, be phrased in the more 
abstract language of magmas, convinced as I am that theorems and theories are better 
understood (and possibly generalized) in absence of what may be referred to as "con- 
ceptual redundancies". In this respect, let me remark from the outset, for any practical 
intents and purposes, that all magmas (and hence semigroups, monoids and groups) 
considered in the sequel, unless differently specified, are written in (standard) additive 
notation but nonetheless are not commutative. Also, the axiom of choice (shortly, AC) 
is assumed. 

With this in mind, let A be a magma, that is a pair {A, +) consisting of a (possibly 
empty) set A and a binary operation + : A x A A. Given two subsets X, Y of A, we 
take 

X + Y := {x + y: X e X,y eY}, X - Y := {z e A : z + y e X for some y € Y}. 
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Note that X + Y = X — Y = (}>ii one of X or y is empty. In particular, we write X + y 
in place of X + y and X — y instead of X — Y whenever Y = {y}. One calls X + Y 
and X — Y, respectively, the sum-set and the difference-set of the pair {X,Y), and then 
defines —Y-\-X as the difference set of {X, Y) in the dual, A°p := (A, +op), of A, where 
+op is the operation ^4 x A ^ A : (^i, 2^2) ^ Z2 + zi. 

We say that A is unital if there exists a distinguished element Oa S A such that 
z + Oa = Oa + z = z for all z; when this is the case, Oa is unique and called the identity 
of A. Then, we let A^ be the set of units of A, with the convention that A^ := if 
A is not unital; if A is unital with identity 0^^, a unit of A is an element z € A for 
which there exists z € A, provably unique and called the inverse of z in A, such that 
z -\- z = z -\- z = Oa- In addition, for Z C A we write {Z)^^^ for the smallest submagma of 
A containing Z and Ca(^) for the center of Z in A. Lastly, given z G A we use ordA(z) 
for the order of z in A, that is ordA(2) := |(-z)a|- The subscript 'A' is omitted from the 
notation whenever A is implied from the context. 

Sum-sets in (mostly commutative) groups have been intensively investigated for sev- 
eral years (see [27] for a recent survey), and interesting results have been also obtained in 
the case of commutative monoids [12]. The present paper aims to be a further contribute 
to this popular (and very active) research area, in the direction of extending parts of 
the theory to the more general setting of semigroups (and magmas), motivated by the 
apparently reasonable expectation that a further level of abstraction might provide huge 
benefits. 

Historically, the first non-trivial achievement in the study of sum-sets is probably 
the Cauchy-Davenport theorem, originally established by A. L. Cauchy [2] in 1813, and 
independently rediscovered by H. Davenport [6, 7] more than a century later. Stated 
in the wording of group theory (rather than in terms of residue classes, as in the early 
formulations of Cauchy and Davenport), the theorem reads as follows: 

Theorem 1 (The Cauchy-Davenport theorem). Let A be a group of prime order p and 
X,Y non-empty subsets of A. Then, \X -\-Y\ > mm{p, \X\ -\- \ Y\ — 1). 

The result has been the subject of many papers and much speculation, and has 
received a number of different proofs, favoring various points of view and eventually 
leading to significant progress on analogous questions, as in the remarkable case of the 
Alon- Tarsi's polynomial method (see [1] and references therein). 

The Cauchy-Davenport theorem applies especially to the additive group of the inte- 
gers modulo a prime. An extension to composite moduli has been very recently given 
by M. R. Murty and J. P. Whang [21, Theorem 5], based on work by T. C. Tao [28], 
using tools from Fourier analysis on commutative finite groups. More contributions in 
the same spirit were previously provided by I. Chowla [4], J. G. van der Corput [0], S. S. 
Pillai [24], and J. M. Pollard [25]; a partial account of these early results can be found 
in [22, §2.3], along with an entire chapter dedicated to the celebrated Kneser's theorem 
[22, Chapter 4], which implies at once, among the other things, both Theorem 1 and 
the main result in [4]. 

Generalizations of a somewhat different flavor have been furnished, still in recent 
years, by several authors. To be more specific, let Oa be the identity of a unitization. 
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A(i), of a magma A (as essentially defined by J. M. Howie in [f6, p. 2], although in the 
case of semigroups), and denote by p(A) the cardinality of the smallest submagma of 
generated by 2 as z ranges in 

^^^^ \ {Oa}, with the convention that p{A) := |N| if 
A*-^-* is trivial, i.e. A^^^ = {0^^}. Then we have the following theorem, due to G. Karolyi 
(cf. [17, Theorem 13]): 

Theorem 2 (Karolyi's theorem for commutative groups). Let A be a commutative 
group and X,Y non-empty subsets of A. Then, \X + Y\ > min(p(A), \X\ + \Y\ — 1). 

Karolyi's proof can be classified as a "transformation proof", and is conceptually 
similar to other transformation proofs so far invented to deal with the Cauchy-Davenport 
theorem and related problems in the additive theory of groups; see [10] for details. 

While Theorem 2 applies to both finite and infinite commutative groups, a like result 
is known to hold for all finite (commutative and non-commutative) groups: 

Theorem 3 (Karolyi's theorem for finite groups). Let A be a finite group and X,Y 
non-empty subsets of A. Then, \X + Y\ > min(p(A), \X\ + \Y\ — 1). 

This was first proved by Karolyi himself [IS], based on the structure theory of group 
extensions, by reduction to the case of finite solvable groups in the light of the celebrated 
Feit-Thompson theorem [11]. 

One consideration that immediately arises is that, given a finite commutative group 
A and non-empty subsets X, Y of A, the bound for the size of |X + y | provided by the 
above Theorems 2 and 3 is far too pessimistic in most situations, as is easily seen, for 
instance, in the limit case where X = Y = A and p{A) is somewhat small with respect 
to the order of A. 

The issue is basically that Theorems 2 and 3 involve a structural "global" property 
of A, which is essentially "extraneous" to the pair {X,Y). Thus, if one really wants to 
improve Karolyi's results further, a good idea may be to replace p{A) with something 
"local", i.e. more tightly linked to X and Y, which has precisely been the fundamental 
insight at the origin of this work. Before coming to the main achievements of the 
paper, we must however recall further results from the literature that are significant 
with respect to the contents of this manuscript. 

The first result is due to J. H. B. Kemperman [1!)], and deals with torsion- free groups. 
As with the others above, we report the statement here for the sake of exposition: 

Theorem 4 (Kemperman's inequality for torsion-free groups). Let A be a group, and 
let X, Y be non-empty subsets of A. Suppose that every non-zero element of A has order 
> \X\ + \Y\ - 1. Then, \X + Y\> \X\ + \Y\ - 1. 

The proof of this, which appears in [1!)] as a corollary of Theorem 5, proceeds by 
cleverly iterating what is now sometimes referred to as the Kemperman transform (see, 
e.g., [10, §2]). It is worth to mention that [19] is, in fact, focused on cancellative semi- 
groups (there simply called semigroups), and it is precisely in this abstract framework 
that Kemperman establishes a series of results, mostly concerned with the number of 
different representations of an element in a sum-set, eventually leading to a proof of 
Theorem 4. 
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Kemperman's inequality boils down to an instance of Theorem 2 in the special case of 
commutative groups. On another hand, Theorem 4 also represents a major generaliza- 
tion of the following folklore result (whose origins are hard to trace): Suppose X and Y 
are non-empty subsets of Z (the ordered ring of integers) of sizes k and i, respectively, 
and let xi, a;2, . . . , Xfc be a numbering of X and yi,y2, ■ ■ ■ ,yi a numbering of Y. Without 
loss of generality, one can assume, as we do, xi < X2 < ■ ■ ■ < Xk and yi < y2 < • • • < Vi- 
Then, 

xi + yi < X2 + yi < ■ ■ ■ < Xk + yi < Xk + y2 < ■ ■ ■ < Xk + yi, 
with the result that > k + i — 1. Interestingly, these same considerations go 

through almost verbatim in the case of linearly orderable magmas (see [29, Proposition 
3.2]), apart from minor complications due to the introduction of parenthesizations (see 
Definition 2): 

Proposition 1.1. Suppose A = {A, ■) is a linearly ordered magma, written multiplica- 
tively. Pick an integer n > 1, and let Xi, X2, ■ ■ ■ , Xn be non-empty subsets of A and 
ri := \Xi\. Then, for any central parenthesization P of A of length n, one has that 

n 

(1) |(XiX2---X„)p| > l-n + J^r^. 

Furthermore, (1) is sharp whenever A is a semigroup and Xi , X2 , • • • , Xn are finite, the 
lower bound being attained by taking, for instance, Xi = {x^ : k € N''",A; < r^} for each 
i, where x is a suitable element of A (not dependent on i). 

As for the rest, S. Eliahou and M. Kervaire [9, Theorem 2.1], improving an earlier 
result of S. Yuzvinsky linked to the Hurwitz problem in topology [-32], have established 
a variant of Theorem 1 for sum-sets in vector spaces over finite fields. Also, a Cauchy- 
Davenport theorem for acyclic unital semigroups has been given by the late Y. O. 
Hamidoune and coauthors [3, Theorem 3], while a graph-theoretic analogue has been 
recently proved by P. Hegarty [15]. 

In fact, we are not aware of more results that can be properly regarded as "natural" 
variants or generalizations of the Cauchy-Davenport theorem, and indeed of no previous 
work concerned with an extension of the theorem to the setting of arbitrary semigroups. 

Thus, with this background in mind, we can finally proceed to state the main result 
of the paper, but first we need the following definition. 

Definition 1. Given a magma A = {A, +) and a subset Z of A, we let 

(2) '^{Z) '■= sup min ord(2; — zq). 

Then, for A:,y C A we define ^a{X,Y) := if X = or Y = 0; n^iX^Y) := 
max(|X|, jyj) if (at least) one of X or y is infinite, and Q.ii^{X,Y) := min(a;(y), |X| + 
|y| — 1) otherwise. We call Q.p^{X^Y) the Cauchy-Davenport constant of{X,Y) relative 
to A, and write it simply as 0,{X, Y) when A is understood from the context. 

Remark 1. We assume here that the supremum and maximum of the empty set are 
0, while its infimum and minimum are 00 (by which we mean, as usual, something 
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larger than any given cardinal). Thus, since a supremum over a non-empty finite set 
of cardinal numbers is actually a maximum, we can obviously replace sup with max in 
(2) whenever 1 < \Z^ \ < oo. Perhaps more interestingly, something similar applies to 
min and inf, which looks intriguing, or at least suggestive, insomuch as it relates, as we 
are going to see, the problem of sizing a sum-set to a sup-inf condition. And we cannot 
really refrain from wondering whether there is something more to disclose behind the 
curtain in this respect. 

Every pair (X, Y) of subsets of a given magma A has a well-defined Cauchy-Davenport 
constant, which is zero (essentially by definition) if = 0. However, this is not the 
case, e.g., when A is a group and Y is non-empty, to the effect that ^1{X,Y) can be 
used for a non-trivial lower bound on the size of X + y as in the next: 

Theorem 5. Let A be a cancellative semigroup and X,Y subsets of A such that (1^)a 
is commutative. Then \X + Y\ > i}{X,Y). 

Theorem 5 represents the most relevant contribution of the paper: Not only it extends 
Theorem 1 to the more general and abstract setting of semigroups (see Section 5). It also 
provides a strengthening of Theorem 2, in that, given a cancellative unital semigroup 
A = {A, +) and a subset Z of A with ^ 0, it is found (see Lemma 5.1) that 

sup min ord(z — zq) > p{A), 

with strict inequality in some significant cases (see Examples 1 and 2). Theorem 5 is 
proved in Section 4. The argument is a delicate refinement of the transformation proof 
originally used by Davenport in [G]. This leads us to define what we call a generalized 
Davenport transform, which might perhaps be considered an interesting by-product in 
its own right. 

Indeed, I am not aware of any previous use of the same technique in a non-commutative 
setting, much less in relation to (cancellative) semigroups. With few exceptions, remark- 
ably including A. G. Vosper's original proof of his own famous theorem on critical pairs 
[30], the "Davenport transform" seems, in fact, to have been more or less forgotten, 
in favor of conceptually similar (but substantially different) "technology" such as the 
Dyson transform [22, p. 42] or the aforementioned Kemperman transform [19]. 

Remark 2. One thing seems worth mentioning before proceeding: While it is true 
that every commutative cancellative semigroup embeds as a subsemigroup into a group, 
this is false in the non-commutative case; see [20] for an explicit example. This serves 
as a fundamental motivation for the present paper, in that it shows that the study of 
sum-sets on cancellative semigroups cannot be systematically reduced, in the absence 
of commutativity, to the case of groups, at least not in any obvious way. 

Based on Theorem 5, we provide two proofs of Chowla's extension of the Cauchy- 
Davenport theorem to arbitrary cyclic groups [4], the second of which comes as a part 
of the following generalization of Chowla's original result: 

Theorem 6. For m € take X and Y to be non-empty subsets of the additive group 
A = ["L/rnL, +, — , Om) of the integers modulo m. Let xi,X2, . . . , be a numbering of 
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X and yi,y2, ■ ■ ■ ,y£ a numbering ofY. Define 

Sx ■= min max gcd(m, Xi — Xj), 6y := min max gcd(m, % — y,) 

l<i<kl<j<k,j=ii J' y l<i<il<j<e,j^i 

(by identifying a residue class with any of its representatives), and set 5 := max{5x, Sy). 
Then 

\X + Y\> mm{5'^m, \X\ + 1F| - 1). 

In particular, \X + Y\ > min(m, |Xj + |y| — 1) if there exists yo & Y (respectively, 
xq X) such that m is prime with y — yo for each y Y \ {yo} (respectively, prime 
with X — xq for each x £ X \ {xq} ). 

Many natural questions arise. In particular, one can ask if it is possible to generalize 
Theorem 5 in such a way to get rid of the assumption that (1")a is commutative. 
Unfortunately, at present, I have no satisfactory answer in this respect. However, the 
question looks interesting, since an affirmative response would provide a comprehensive 
generalization of about all the previous extensions of the Cauchy-Davenport theorem 
recalled in this introduction, and most remarkably of Theorems 3 and 4. 

1.1. Organization. The plan of the paper is as follows. In Section 2 we establish basic 
identities and estimates concerning sum-sets in semigroups and magmas. In Section 3 
we introduce the generalized Davenport transform and prove some of its fundamental 
properties. In Section 4 we prove our main theorem. In Section 5 we provide some 
additional results, including proofs of Theorem 2, Theorem 6 and Chowla's theorem for 
cyclic groups [4]. In Section 6 we use Theorem 5, in combination with Hall's theorem on 
distinct representatives, to gain some more information on the "internal arrangement" 
of sum-sets (by adapting a previous idea of 0. J. R0dseth [20]). Lastly, in Section 7 
we include a couple of examples to show that Theorem 5 can be significatively sharper 
than Theorem 2. 

2. Preliminaries 

This section collects basic properties of sum-sets that will be used later to introduce 
the generalized Davenport transform and prove Theorem 5. Some proofs are direct 
and standard (and thus omitted without further explanation), but we have no explicit 
references to anything similar in the context of semigroups (and magmas), so we include 
the statements here for the sake of exposition. But first we recall the notion of a central 
parenthesization from [20, §2.1]. 

Definition 2. Let A = {A,-k) be a magma. Given n € N"*", we define recursively 
Pi (A) := {id^} and P„+i(A) := Cn+i{^) U 7^„.+l(A), where idyi is the map A ^ A : 
a ^ a and 

(i) £ri+i(A) is the set of all functions A*^"*"^ — >■ A sending, for some / G T'„,(A), a 
(n + l)-tuple (ai, a2, . . . , a„+i) to the product ai ★ /(a2, 03, ... , a„+i). 

(ii) 7^„,+i(A) is the set of all functions A^^-*^ A mapping, for some / € 'P„(A), a 
(n + l)-tuple (ai, 02, . . . , a„+i) to the product /(ai, 02, ... , a„) ★ a„+i. 
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For n G N"*", we refer to an element P of Vni^) as a central parenthetization of A of 
length n, or also a central n-parenthetization of A. Moreover, for ai,a2, . . . , 0,1 € A, we 
write {ai-k • • • * a„)p in place of P(ai, 02, ... , a„) and, whenever Si, S2, . . . , 5„ are 
subsets of A, we let 

(Si * 5*2 ★ • • • * Sn)p ■■= {{Sl ★ S2 ★ • • • ★ Sn)p : Si G 5*1 , S2 S ^2, . . . , S„ G ^n} 

(note that this is empty if 5^ = for some i). When Si = S2 = • • • = Sn ='■ S , we 
use (<S'"')p in place of (Si * ^2 * • • • * Sn)p if A is written multiplicatively, and {nS)p if 
it is written additively. Moreover, if A is a semigroup or n < 2, then we simply write 
Sl -k S2 * • • ■ Sn in place (^i -k S2 * • • • * Sn)p, since in both cases the result does not 
really depend on the choice of P. Similar considerations apply to (oi *a2 * • • • *a„)p, in 
the most obvious way. 

Parenthetization is just a formal way to deal with expressions involving three or more 
operands in a magma whose operation is not associative. Also, note that we are not 
considering here all of the possible parenthesizations of such expressions, but only some 
special class of them (which we refer to as central). Based upon these premises, we 
can proceed with a couple of lemmas, providing trivial identities and estimates (cf. [28, 
Lemma 2.1]). 

Lemma 2.1. Let X,Y,Xi,X2,Yi,Y2 be subsets of a magma A = (A, +) for which 
^1 ^ ^2 oind Yi QY2. Then the following holds: 

(i) X + Y = Y +op X, and hence \X + Y\ = \Y +op X\. 

(ii) Xi + Y CX2 + Y andX + YiCX + Y2. 

(iii) \Xi + Y\ < \X2 + Y\ and \X + Yi\ < |X + y2|. 

In spite of being so trivial. Lemma 2.1 is useful in many situations, for instance to 
prove that a certain property holds for semigroups that are left or right cancellative 
by just proving that it holds in one of the two cases, which allows us to cut by half 
the number and the length of some mathematical statements. In particular, to express 
that something is true by point (i) of Lemma 2.1, we will simply say that it is true "by 
duality" . 

Lemma 2.2. For a left cancellative magma A = {A,+) the following holds: 

(i) \X\ = \z + X\ for every z G A and X C A. 

(ii) IfX,Y are subsets of A and X ^ 0, then \Y\ < \X + Y\. 

(iii) Given an integer n > 1, let Xi,X2, ■ ■ ■ ,Xn be non-empty subsets of A and P a 
central parenthesization of A of length n. One then has that \Xn\ < {{Xi + X2 + 

1- Xn)p\. 

Proof. We prove point (iii). For set X := {Xi + X2 + • • • + Xn)p- The claim is obvious 
if n = 1, while for n = 2 it reduces to point (ii) above. Therefore, let n > 3 and 
suppose that the statement holds true for every k = 1,2, ... ,n — 1. There exists a 
central parenthesization Q of A of length n — 1 such that X = {Xi + • • • + X„_i)q + Xn 
or X = Xi + {X2 + • • • + Xn)Q- Since the sets Xi, X2, . . . , Xn are non-empty, the same 
point (ii) above and the inductive hypothesis then give that |X„| < \X\ in the first case, 
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and \Xn\ < 1(^2 + ■ ■ ■ + ^n)Q| ^ l-'^l in the second. Thus, the conclusion follows (by 
induction) . ■ 

For the subsequent lemma we assume that 0-k := k-0 := for every cardinal number 

Lemma 2.3. Let A = (^4, +) he an arbitrary magma. One has the following: 

(i) If X,Y are subsets of A, then |X + y| < \X\ ■ \Y\. 

(ii) Given an integer n > 1, let Xi, X2, ■ ■ ■ , be subsets of A and P a central 
parenthesization of A of length n. Then \{Xi + X2 + • • • + ^n)p| < nfc=i l^fcl- 

Proof We prove point (ii). For set X := {Xi+X2+- • •+Xn)p- For n = 1 the assertion is 
obvious, while for n = 2 it reduces to point (i) above. As a consequence, assume n > 3 
and let the statement hold true for every k = 1, 2, . . . , n — 1. There exists a central 
parenthesization Q of A of length n — 1 such that X = (Xi + • • • + Xn-i)ci + 

X = Xi + {X2^ \-Xn)Q. It follows from point (ii) that \X\ < |(XiH hXn-i)Q\-\Xn\ 

in the first occurrence and |X| < \Xi \ ■ \ {X2 + • • • + Xn)Q\ in the second. Whatever the 
case may be, the inductive hypothesis then gives that \X\ < YYk=i which implies 
the claim (by induction). ■ 

Remark 3. No matter if the ambient semigroup is cancellative, nothing similar to 
Lemmas 2.2 and 2.3 applies, in general, to difference-sets of type X — Y, to the extent 
that X — Y can be empty even if X and Y are infinite. On another hand, it follows, by 
duality, from point (i) of Lemma 2.2 that, in presence of cancellativity, the cardinality 
of sum-sets of type X -\-Y is preserved under translation, which is a point in common 
with the case of groups, save the fact that one cannot take advantage of this, at least in 
general, to "normalize" either of X or y in such a way as to contain some distinguished 
element of A. 

The next lemma is central to the use of Davenport transforms in our proof of Theorem 

5. 

Lemma 2.4. Let A be a semigroup and X,Y subsets of A. Then, the following condi- 
tions are equivalent to each other: 

(i) X + 2Y QX + Y. 

(ii) X + nY CX + Y for all n G N+. 

(iii) X + {Y)a = X + Y. 

Proof. Points (ii) and (iii) are clearly equivalent, as X + (y)A = U5^i(^ + '^^)i (i) 
is obviously implied by (ii). Thus, we are left to prove that (ii) follows from (i), which 
is immediate, by a routine induction, using the fact that, if X + nY <^ X + Y for some 
n G N+, then X + [n + l)Y = {X + uY) + Y {X + Y) + Y = X + 2Y <Z X + Y . ■ 

On another hand, the following lemma shows that, in reference to Theorem 5, there 
is no loss of generality in assuming that the ambient semigroup is unital. 

Lemma 2.5. Suppose that Ai = (^i,-|-i) and A2 = (^21+2) are magmas, and let 
(j) : Ai ^ A2 be a (magma) monomorphism and X,Y subsets o/Ai. Then \X +1 Y\ = 
\cPiX) +2 <P{Y)\. 
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We close this section with few simple but remarkable properties of units: 

Lemma 2.6. Let A = {A,+) be a unital semigroup with identity 0^, X a subset of A, 
and z a unit of A with inverse z. Then the following holds: 

(i) X - z = X + z and -z + X = z + X. 

(ii) If A is cancellative, then \ — z + X\ = \X — z\ = \X\. 

(iii) If z (z Ca(^) and A is cancellative, then z S Ca{X), and in addition {X — z)a 
and {—z + X)a are commutative if {X)^ is commutative. 

Proof (i) By duality, it suffices to prove that X — z = X + z. But this is trivial from 
the fact that w £ X — z if and only if there exists x G X such that w + z = x, which in 
turn is clearly equivalent to saying that x + z = w + z + z = ui, viz w G X + z. 

(ii) It is straightforward by duality, point (i) of Lemma 2.2 and point (i) above. 

(iii) Suppose z G Ca(-^) and pick x £ X. By cancellativity, one has that x+z = z+x if 
and only ifx = z+x+z, and this condition is certainly verified asz+x+z = z+z-\-x = x 
(from the hypothesis that z is in the center of A). It follows that z G Cp^{X). With 
this in hand, assume that is commutative and let wi,W2 G {X — z)p^. By point (i) 
above, there must exist xi, X2 G X such that Wi = Xi-\- z, which easily implies that 

Wl + W2 = Xl + Z X2 + Z = Xl + X2 + 2z = X2 + Xl + 2z = X2 + Z + Xl Z = W2 + Wl, 

using that z G Ca(X), as we have just proved, and (X)a is commutative. This ulti- 
mately shows that {X — z)a is commutative too, which completes the proof by dual- 
ity. ■ 

Remark 4. There is a notational subtleness here that it may be worth to underline 
before proceeding. Suppose that A is a unital semiring and x, y G A. In principle, 
X — y and —y + x are not elements of A: In fact, they are difference-sets, and no other 
meaningful interpretation is (a priori) possible. However, if y G A^ and y is the inverse 
of y in A, then x — y = {x + y} and —y + x = {y + x} hy point (i) of Lemma 2.6, and 
we are allowed to identify x — y with x + y and —y + x with y + x, which will turn to 
be very useful in various places later on. 

3. The Davenport transform revised 

As mentioned in the introduction, Davenport's proof [6, Statement A] of Theorem 1 
is a transformation proof. Loosely speaking, the idea is to map a pair {X, Y) of non- 
empty subsets of a commutative group A = {A,+,—,Oa) to a new pair {X,Y'), which 
is smaller than {X, Y) in an appropriate sense, and specifically such that 

|y'| < |y|, \X + Y'\ + \Y\ <\X + Y\ + \Y'\. 

One then refers to {X,Y') as a Davenport transform of {X,Y); cf., e.g., [26, §3]. The 
construction implies that X + 2Y ^ X + Y and 0^ G to the effect that \Y\ > 2. 

As broadly expected, many difficulties arise when attempting to adapt the same 
approach to semigroups, and that all the more if these are non-commutative. Even the 
possibility of embedding a semigroup into a monoid does not resolve anything at all, 
since the fundamental problem is that, contrary to the case of groups, cardinality is not 
preserved "under difference". To wit, if A = (^4, +) is a unital semigroup with identity 
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Oa, X is a subset of A, and a is an element of A, then the cardinahties of X, X — a and 
—a + X can be greatly different from each other, even supposing that A is cancellative; 
cf. point (ii) of Lemma 2.6. Thus, unless A is a group in disguise or, more generally, 
embeds as a submonoid into a group, one is not allowed to assume, for instance, that 
0^ G y by picking an arbitrary element i/q gY and by replacing {X, Y) with the pair 
{X + uo, —yo + Y); cf. Remark 3. 

In fact, the primary goal of this section is to show that, in spite of such issues, 
Davenport's original ideas can be successfully extended to the more general setting of 
(cancellative) semigroups, and used to give a proof of Theorem 5. 

To start with, suppose that A = {A, +) is a unital semigroup with identity 0^ and 
let X, Y be subsets of A and mX + 2Y X + Y for some m € N^. For brevity, define 

Z := {mX + 2Y)\{X + Y). 

Our assumptions give Z 7^ 0. Thus, fix 2; € Z, and take G (m — 1)X and G Y 
such that z £ Xz + X + Y + yz, where OX := {0^}. Finally, set 

(3) Yz:={y€Y:zexz+X + Y + y}, :=y\y,. 

We refer to {X,Yz) as a generalized Davenport transform of {X,Y) (relative to z) and, 
based on this notation, proceed on with the next proposition, intentionally organized in a 
list of properties argued starting from various "local" (rather than "global" ) hypotheses, 
to remark differences with the "classical" Davenport transform and highlight which is 
used for which purpose. 

Proposition 3.1. IfYz 7^ 0, then the triple {X,Yz,Yz) satisfies the following conditions: 

(i) Yz and Yz are non-empty disjoint proper subsets ofY, and Yz = Y \ Yz- 

(ii) If A is right cancellative, then {xz + X + Yz) U (z — Yz) ^ Xz + X + Y . 

(iii) // (y)A is commutative, then {xz + X + Yz) fl (z — Yz) = 0. 

(iv) If A is left cancellative, then \z — Yz\ > \Yz\. 

(v) If A is cancellative and {Y)p^ commutative, then \X + Y\ + \Yz\ > \X + Yz\ + \Y\. 

Proof, (i) By construction, yz G Yz, that is Yz and Yz are both non-empty. Also, (3) 
gives that Yz,Yz^Y and y^ n i; = 0, to the effect that Y\Yz = Y\{Y\Yz) = Yz and 

y.,y. cy. 

(ii) Since ^ ^ y by point (i) above, Xz + X + Yz C + X + y by Lemma 2.2. 
On the other hand, w € z — Yz then there exists y € Yz such that z = w + y. But 
y GYz implies by (3) that z = w + y for some w ^ Xz + X + Y , whence w = w hy right 
cancellativity, i.e. w & Xz + X + Y . 

(iii) Assume the contrary and let w G {xz + X + Yz) PI (z — Yz). There then exist 
X G X, yi G yz and y2 G Yz such that w = Xz + x + yi and z = w + y2- Using that (Y)p^ 
is commutative, this gives that z = Xz + x-\-yi-\-y2 = Xz + x -\- y^ + yi, which implies 
that yi G Yz by (3) since Yz,Yz ^ y by point (i). This is however absurd as Yzr\Yz = 0, 
again by point (i). 

(iv) We have from (3) that for each y ^ Yz there exists w ^ Xz + X + Y such that 
z = w + y, which yields that w G z — Yz- On the other hand, since A is left cancellative. 
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it cannot happen that w + yi = w + 1/2 ior some li) € A and distinct yi, 2/2 S Yz- Thus, 
Yz embeds as a set into z — Yz, with the result that \z — Yz\ > \Yz\. 

(v) Since A is cancellative and X ^ (otherwise Z = 0), one has |X + yj > 
max(|X|, |y|) by point (ii) of Lemma 2.1 and point (ii) of Lemma 2.2. This implies the 
claim if Y is infinite, since then either |X + y| > and hence |X + y | + \Yz\ = \X\ = 
\X + Yz\ + \Y\, or \X + Y\ = \Y\, and accordingly \X + Yz\ + \Yz\ = \Y\ = \X + Yz\ + \Y\ 
(note that we are using here the AC). Thus, we are left with the case where Y is finite, 
for which the inclusion-exclusion principle, points (ii)-(iv) above, the same point (ii) of 
Lemma 2.1 and point (i) of Lemma 2.2 entail that 

\X + Y\ = \xz + X + Y\ > \xz+X + Yz\ + \z-Yz\ = \X + Yz\ + \z -Yz\ > |X + y,| + |f;|. 

But Yz = Y\Yz and y^ C y by point (i), so that \X + Y\>\X + Yz\ + \Y\ - \Yz\. ■ 

Remark 5. To apply the generalized Davenport transform to the proof of Theorem 
5, it will be enough to consider the case where m = 1, for which it is easily seen that 
£ Yz Oa & Y (we continue using the notation from above), as in the contrary case 
we would have z € X + Y, contradicting the fact that z G {X + 2Y)\{X + Y). However, 
it seems intriguing that the same machinery can be used, at least in principle, even if 
m > 2 in so far as one has a way to prove that y^ 7^ 0, which is the reason why we 
decided to approach the subject as we have done. 

4. The proof of the main theorem 

Lemma 3.1 accounts for elementary properties of generalized Davenport transforms. 
It will serve here to establish the main contribution of the paper. For all practical 
intents, we remark that some results from Section 2, as basic as they are, will be used in 
the proof without explicit mention. This is especially the case of point (iii) of Lemma 
2.1 and point (i) of Lemma 2.2. 

Proof of Theorem 5. Since every semigroup embeds as a subsemigroup into any of its 
unitizations (unique up to a monoid isomorphism) , and any unitization of a cancellative 
semigroup is cancellative in its own right, there is no loss of generality in assuming, as 
we do in the sequel in the light of Lemma 2.5 and Definition 1, that A is unital. 

Thus, suppose by contradiction that the theorem is false. Then, there must exist at 
least one pair [X, Y) of subsets of A for which |X + y| < Q{X, Y), whence 

(4) 2 < |xj,|y| < 00. 

In fact, if one of X or y is empty then |X + y | = 0, while if both X and Y are non-empty 
but one of them is a singleton or infinite then |X + y| = max(|X|, \ Y\). In both cases. 
Definition 1 gives that |X + y| = Q{X,Y), contradicting our assumptions. It follows 
from (2) and (4) that 

(5) |X + y|< sup min ord(y-yo), |X + y| < |X| + |y| - 2. 

J/oGyx y&Y\{yo} 

Again without loss of generality, we also assume that \X\ + \ Y\ is minimal among the 
pairs of subsets of A for which (4) and (5) are presumed to hold. Now, since \X + Y\ 
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is finite by (4) and point (i) of Lemma 2.3, one gets by (5) that is non-empty and 
tliere exists yo such that 

(6) |X + y| < min ord(y — yg)- 

y&y\{yo} 

So letting Oa denote the identity of A and taking Wq := Y — yo imply by (5) and (6) 
that 

(7) \X + Wo\< min ord^, \X + Wo\ < \X\ + \Wo\ - 2, 

weWo\{OA} 

as on the one hand |y — yo| = \Y\ and + y — yo| = |^ + ^| by point (ii) of Lemma 2.6, 
and on the other hand, y € y\{yo} only if y— yo € (y — yo)\{OA} and w € (y— yo)\{Oyi} 
only if w + yo (z Y \ {yo} (see also Remark 4). We claim that 

(8) Z:={X + 2Wo) \{X + Wo) + 0. 

For suppose the contrary. Then, X + Wo = X + (VFo)a by Lemma 2.4, with the result 
that 

\X + Wo\ = |X + (Wo>a| > KVFo)a! > max ord(i(;) > min ord(t(;), 

where we use, in particular, point (ii) of Lemma 2.2 for the first inequality and the fact 
that |VFo| ^ 2 for the last one. This is however in contradiction to (7), and hence (8) is 
proved. 

Pick z ^ Z and let (X, VFg) be a generalized Davenport transform of (X, l^o) relative 
to z. As y generates a commutative subsemigroup of A (by hypothesis), the same 
holds true with Wo^ by point (iii) of Lemma 2.6. Moreover, 0^ S Wq (essentially by 
construction), and thus 

(9) Oa G T^o / 0, ^0 £ ^0, 

when taking into account Remark 5 and point (i) of Proposition 3.1. As a consequence, 
point (v) of the same Proposition 3.1 yields, together with (7), that 

\X + W'o\ + \Wo\ < |X + Wo\ + < 1^1 + iW^ol - 2 + \W'o\, 
which ultimately means, since \Wo\ = |y — yo| = |y j < oo by (4) and the above, that 

(10) \X^Wq\<\X\^\W'q\-2. 

It follows from (9) that 1 < IVFqI < l^ol) and indeed \W'o\ > 2, as otherwise (10) 
would give that |X| = \X + Wq\ < \X\ — 1, which is absurd since \X\ < cx) by (4). To 
summarize, 

(11) 2 < \Wo\ < \Wo\ < oo. 
Furthermore, (7) and (9) entail that 

(12) \X + W^\ <\X + Wo\ < min ord(u;), 

w&W^\{Oa} 
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where we use the elementary property that min(Ci) < min(C2) if Ci and C2 are sets of 
cardinal numbers with C2 C Ci. Now, since Oa G Wg"", then (12) implies that 

(13) \X + Wq\< sup min oTd{w), 

which gives, together with (4), (10) and (11), that |X + Wq| < Q{X,Wq), contradicting 
the minimality of \X\ + |y| in that \Wq\ < \Wo\ = \Y\, and hence \X\ + \Wq\ < 

|x| + |y|. ■ 

5. Some consequences 

Now we prove a series of (rather immediate) corollaries, the first of which will confirm 
that Theorem 5 is actually both a generalization of Theorem 1 and a strengthening of 
Theorem 2, as already mentioned by the end of the introduction. We start with two 
simple lemmas. 

Lemma 5.1. Let A = (A, +) be a cancellative unital semigroup and Z a non-empty 
subset of A such that ^ 0. Then sup^^g^x Tiim.z<^z\{zQ] ord(z — zq) ^ vi.^- 

Proof. Pick zq Z^ using that 7^ 0. If Z is a singleton, the assertion is trivial 
since then min^g^^j^g} ord(z — zq) = 00. Otherwise, let z £ Z \ {zq}. Then, by the very 
definition of p(A), one has ord(2; — zq) > p{A), which is clearly enough to complete the 
proof. ■ 

Remark 6. In the case of groups. Lemma 5.1 obviously applies to any non-empty 
subset. 

Lemma 5.2. Let A = {A,+) be a cancellative semigroup and X,Y non-empty subsets 
of A such that V / 0. Then n{X,Y) > min(p(A), \X\ + |y| - 1). 

Proof If (at least) one of X or y is infinite, the assertion is trivial, in that il.{X, Y) = 
\X\ + \Y\ — 1 = max(|X|, |y|). Otherwise, it follows from Lemma (5.1). ■ 

Corollary 5.1. Let A = (A, +) be a cancellative semigroup andX,Y non-empty subsets 
of A such that {Y)a is commutative and Y^ ^ 0. Then \X + Y\ > min(p(A), \X\ + 
\Y\-l). 

Proof. It is immediate by Lemma 5.2 and Theorem 5. I 

Remark 7. Corollary 5.1 contains Theorem 2 (and hence Theorem 1) as a special case. 

The bound in Theorem 5 can be slightly strengthened in the case where both sum- 
mands generate commutative subsemigroups. 

Corollary 5.2. Let A = (A, +) be a cancellative semigroup and X, Y subsets of A such 
that is commutative. Then \X + Y\ > Q{Y,X). 

Proof. It is straightforward from point (i) of Lemma 2.1 and Theorem 5. I 

Corollary 5.3. Let A = {A,-\-) be a cancellative semigroup and X,Y subsets of A such 
that and (y)A 0,1"^ both commutative. Then \X -\-Y\ > ma,x{Q{X,Y),Q{Y,X)). 
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Proof. It is a trivial consequence of Corollary 5.2 and Theorem 5. I 

Theorem 5 also allows for an alternative proof of I. Chowla's theorem for composite 
moduli [4]. Here, as in the statement of Theorem 6, we continue identifying a residue 
class with any of its representatives wherever it may be necessary. 

Corollary 5.4. Let A = {A, +) be a cancellative unital semigroup with identity Oa o,nd 
X,Y non-empty subsets of A such that (1")a is commutative and Oa € Y. Then 



\X + Y\> min 



I min ord(w), IXI + \Y\ — 11. 



Proof. If one of X or y is infinite, the claim is trivial, since then 

\X + Y\= max(|X|, \Y\) = \X\ + |y| - 1. 
In all other cases, Theorem 5 implies that |X + y| > mm{uj{Y), \X\ + |y| — 1), where 

uj{Y) := sup min > min ord(y), 

yo&Yxy^y\{yQ} y(^y\{OA} 

because Oa &Y^. Clearly, this suffices to complete the proof. ■ 

Corollary 5.5. Let A = (A, +, — , 0) be a group and X, Y non-empty subsets of A such 
that {Y)j^ is commutative. Then, \X + Y\ > mm{uj{Y), \X\ + |y| — 1), where 

uj{Y) = sup min ord(y — yo), 

yoeY y<^Y\{yo} 

and indeed u}{Y) = maxy^^y '^T^^^yeY\{yo} ord(y — yo) if ^ is finite. 

Proof. It is immediate from Theorem 5 since on the one hand A being a group implies 
Y = Y^ , and on the other a supremum over a finite set is a maximum (cf. Remark 
1). ■ 

Theorem 7 (Chowla's theorem for composite moduli []]). Let m € N~*" and denote by 
A the additive group (Z/mZ,-\-,—,Om) of the integers modulo m. IfX,Y are subsets of 
A with Orn ^Y and gcd(m, y) = 1 for every y ^Y\ {0^}, then \X + Y\ > min(m, \X\ + 
\Y\-l). 

Proof. It is straightforward from Corollary 5.5 since ord(y) = m for each non-zero 
residue class y £Y, using that gcd(m, y) = 1 by hypothesis. ■ 

And here is the proof of our generalization of Chowla's result: 

Proof of Theorem 6. As a matter of fact, A is a commutative finite group, for which one 
has especially that ord(z — zq) = m/ gcd(m, z — zq) for all z, zq € A. Thus, Corollaries 
5.3 and 5.5 entail that |X + y| > min(a;(y), \X\ + \Y\ - 1), where 

uj(Y) = max min ord(y — yo) = m ■ max min ^ = 6~^m. 

yoeY yeY\{yo} yoeY yeY\{yo} gcd(m, y - yo) ^ 

Now in an entirely similar way, it is found, in the light of Corollary 5.2, that 

\X + Y\> mm{5~^m, \X\ + \Y\ - 1), 
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which is clearly enough to complete the proof by considering that 6y = 1 (respectively, 
dx = 1) if there exists i/q (respectively, xq G X) such that m is coprime with y — yo 
(respectively, with x—xq) for every y € Y\{yQ} (respectively, for every x G X\{a;o}). ■ 

Obviously, Theorem 6 contains Theorem 7 as a special case. 



6. Distinct representatives 

As noticed by 0. J. R0dseth in [26, §6], Davenport's paper [6] begins on the very 
same page of the 1935 volume of Journal of the London Mathematical Society which 
marks the end of the original article by P. Hall containing his well-known theorem about 
distinct representatives [13], herein referred to simply as Hall's theorem: 

Hall's theorem. For k G N"*" let Si, S2, ■ ■ ■ , Sk be sets. There then exist (pairwise) 
distinct elements si, S2, ■ ■ ■ , Sk such that Si G Si and Si 7^ Sj for i ^ j, if and only if the 
union of any h of these sets contains at least h elements for every h = 1,2, ... ,k. 

Following [2(), §6], we use Hall's theorem to say something on "how to spot", by an 
algorithmic procedure, some of the elements of a sum-set. More precisely, suppose that 
A = (A, +) is a cancellative semigroup and let X, Y be non-empty finite subsets of A 
such that 



(14) 



\X + Y\< uj{Y) :-- 



max min ord(?/ 

yo&Y>^ yeY\{yo} 



yo) 



Clearly, (14) implies that Y^ is non-empty. Now, define k := \X\ and £ := \Y\, and 
denote by xi,X2, . . . ,Xk & numbering of X and hy yi,y2, ... ,yi & numbering of Y . Then, 
consider the following matrix: 



(15) 



a{X,Y) :-- 



xi + yi xi + y2 ... xi + ye 

X2 + yi X2 + y2 ■ ■ ■ X2 + y£ 

xk + yi Xk + y2 ■■■ xk+yi 



Any element of X + Y appears in some row or column of a{X,Y), and viceversa any 
entry of a{X, Y) is an element X+Y. Also, Theorem 5, Remark 1 and our hypotheses 
give |X -|- y I > k + i — 1. So it is natural to ask whether it is possible to argue anything 
about the actual positions where in the rectangular array (15) one can find k + i — 1 
(distinct) elements of Indeed, based on this notation, we can prove the following 

proposition, whose proof is almost identical to the one of [20, §6] (since it is short, we 
include it here for the sake of exposition): 

Proposition 6.1. Assume that {Y)^ is commutative and let Z be any subset ofX + Y 
of size £ — 1, for instance Z = xi + {yi,y2, ■ ■ ■ , ye-i}- Then we can choose one element 
from each row of a{X,Y) so that Z and these elements form a subset ofX + Y of size 
k + £-l. 

Proof. For each i let Zj := (x, + Y) \ Z and note that Zi is a set of elements in the 
i-th row of a{X,Y). Then, for all h G N"*" and pairwise distinct indices ii,i2, ■ ■ ■ ,ih & 
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{1,2,..., k} one has Zj^ U Zi^ U • • • U Zj^ = ({xj^ , a^jj, . . . , Xi^} +Y)\Z, with the result 
that 

\Zi, UZi^U---UZiJ> \{xi,,Xi^,...,Xi^} + Y\ -\Z\ = h + £-l-{£-l) = h, 

thanks to Theorem 5 and the fact that \{xi-^, Xi2, . . . } + Y\ < \X + Y\ < uj{Y) by 
point (iii) of Lemma 2.1 and (14). It follows from Hall's theorem that we can find k 
distinct elements, one from each Zi. Together with the i—1 elements of Z, this provides 
a total amount of k + i — 1 elements of X + y since, by construction, Z fl = for 
every i. ■ 



7. Examples 

We conclude with a couple of representative examples showing that Theorem 5 can 
actually be greatly sharper than Theorem 2. Both of them are concerned with groups. 

Example 1. Pick € N"*" with q prime. Define m := qk, and let A denote the 
additive group (Z/mZ, +, —,0m) of the integers modulo m. Consider the subset 

X :={!+ ik:i = 0,1,..., q-1} 

of A. Then, 2X = {2 + ik : i = 0, 1, . . . , 2g - 2} and n{X, X) = q, while p{A) is 
the smallest natural prime p dividing m. It follows that p(A) is (much) smaller than 
i}{X,X) if p is (much) smaller than q, while \2X\ = g + 1 in all cases. Now taking 
the direct (group) product of many copies of A, or more generally considering products 
relative to different moduli, yields other significant examples in the same spirit. 

Example 2. Let A be a non-empty alphabet consisting of two or more letters. Pick 
zq A and denote by A the (multiplicatively-written) group with presentation {A \ 
Zq = l)Grp (adding more relations in the presentation of A yields a variety of similar 
examples). Let X,Y be non-empty finite subsets of A with Y C for some z G 
A \ {zq}. Then, (Y)^ is clearly commutative and 

sup min ord(y — yo) = \^\-, 

to the effect that |X + y| > \X\ + \Y\ — 1 by Corollary 5.5. On the other hand, one has 
that min(p(A), |X| + |y| — 1) = p(A) = 2 whenever |X| + \Y\ > 3, with the result that 
the estimate on \X + Y\ provided by Theorem 2 is far too pessimistic, and definitely 
worse than the one furnished by Theorem 5, as soon as \X\ + \Y\ is "large". 
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